21 research outputs found

    Integration of the 3D Environment for UAV Onboard Visual Object Tracking

    Full text link
    Single visual object tracking from an unmanned aerial vehicle (UAV) poses fundamental challenges such as object occlusion, small-scale objects, background clutter, and abrupt camera motion. To tackle these difficulties, we propose to integrate the 3D structure of the observed scene into a detection-by-tracking algorithm. We introduce a pipeline that combines a model-free visual object tracker, a sparse 3D reconstruction, and a state estimator. The 3D reconstruction of the scene is computed with an image-based Structure-from-Motion (SfM) component that enables us to leverage a state estimator in the corresponding 3D scene during tracking. By representing the position of the target in 3D space rather than in image space, we stabilize the tracking during ego-motion and improve the handling of occlusions, background clutter, and small-scale objects. We evaluated our approach on prototypical image sequences, captured from a UAV with low-altitude oblique views. For this purpose, we adapted an existing dataset for visual object tracking and reconstructed the observed scene in 3D. The experimental results demonstrate that the proposed approach outperforms methods using plain visual cues as well as approaches leveraging image-space-based state estimations. We believe that our approach can be beneficial for traffic monitoring, video surveillance, and navigation.Comment: Accepted in MDPI Journal of Applied Science

    READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object Segmentation

    Full text link
    We present READMem (Robust Embedding Association for a Diverse Memory), a modular framework for semi-automatic video object segmentation (sVOS) methods designed to handle unconstrained videos. Contemporary sVOS works typically aggregate video frames in an ever-expanding memory, demanding high hardware resources for long-term applications. To mitigate memory requirements and prevent near object duplicates (caused by information of adjacent frames), previous methods introduce a hyper-parameter that controls the frequency of frames eligible to be stored. This parameter has to be adjusted according to concrete video properties (such as rapidity of appearance changes and video length) and does not generalize well. Instead, we integrate the embedding of a new frame into the memory only if it increases the diversity of the memory content. Furthermore, we propose a robust association of the embeddings stored in the memory with query embeddings during the update process. Our approach avoids the accumulation of redundant data, allowing us in return, to restrict the memory size and prevent extreme memory demands in long videos. We extend popular sVOS baselines with READMem, which previously showed limited performance on long videos. Our approach achieves competitive results on the Long-time Video dataset (LV1) while not hindering performance on short sequences. Our code is publicly available.Comment: Accepted to BMVC 2023. Code @ https://github.com/Vujas-Eteph/READMe

    Estimating the precision of under-water video-mosaics

    No full text
    Under-water video-mosaics are an important tool e.g. for inspection of man-made infrastructure. Cameras may drift in rotation and distance to the surface while the mosaic will often be very much larger than a single frame resulting in long chains of planar homographies. This contribution addresses the problems arising from dead-reckoning drift in such chains. Local patches are rectified using homography decomposition. Experiments with a portal allowing underwater image sequences with mechanical groundtruth are performed. Thus the deviation of the mosaic from a true orthographic map may be estimated

    Designing observer trials for image fusion experiments with Latin Squares

    No full text
    Multisensor image fusion (e.g. IR with visual) is the process of combining relevant information from two or more images into a single image. The aim is to find an objective quality measure, which can be used in automatic applications, that correlates best with subjective observer trials. Not all combinations of image, algorithm, and test observer can be worked out. In this paper R. Fisher’s Design of Experiments approach based on Latin Squares is used for thinning out the number of experiments for each observer in such observer trials while preserving exactness and reliability of the result

    Simulation of the interaction of a high energy laser beam with the sea surface in the short wavelength infrared

    No full text
    The knowledge of the interaction of a high energy laser beam with a dynamic sea surface is of great practical interest in maritime environments. The components transmitted into the sea and reflected at the sea surface have to be considered. The calculation of energy transfer into the sea is fundamental to the prediction of upper-ocean heating and temperature-dependent optical properties of the sea, which in turn influence its reflectance characteristics. In addition, the spatial energy (or power) distribution of the laser beam reflected at the dynamic sea surface is also of high significance. For the estimation of the laser light energy reflected into a specific spatial direction, several parameters need to be considered, e.g., wind speed, wind direction, and fetch. The calculated amount of light energy reflected into a specific direction varies statistically and depends largely on the dynamics of the wavy sea surface. A 3D simulation of a dynamic sea surface is presented interacting with a high energy laser beam in the short wavelength infrared spectral band. The simulation computes the upper-ocean heating, the temperature-dependent Fresnel reflectances, and the absorption in seawater considering the laser geometric configuration. For the reflectance calculations, a bistatic configuration of the laser source and receiver is regarded, where the receiver positions are on a virtual hemisphere having the laser spot center as the center point. The specular reflection of the laser beam at the sea surface is modeled by an analytical statistical bidirectional reflectance distribution function of the sea surface. The simulation is restricted to sea surfaces heated to the boiling point to avoid complex phase transition effects between water and gas. For a high energy laser beam focused on a small laser spot on the evolving wavy sea surface, the maximum expected reflected laser power is calculated for the specular forward- and back-reflection direction for glints. The probability of occurrence and temporal occurrence of those glint events is estimated for both directions

    Comparing visual tracker fusion on thermal image sequences

    No full text
    Visual object tracking is a challenging task in computer vision, especially if there are no constraints to the scenario and the objects are arbitrary. The number of tracking algorithms is very large and all have diverse advantages and disadvantages. Normally they show various behaviour and their failures in the tracking process occur at different moments in the sequence. So far, there is no tracker which can solve all scenarios robustly and accurately. One possible approach to this problem is using a whole collection of tracking algorithms and fusing them. There exist various strategies to fuse tracking algorithms. In some of them only the resulting outputs are fused. This means that new algorithms can be integrated with less effort. This fusion can be called "high-level" because the tracking algorithms only interact through the last step in their procedure. Three fusion methods are investigated. They are called Weighted mean fusion, MAD fusion and attraction field fusion. In order to evaluate the three different approaches a collection of thermal image sequences has been investigated. These sequences show maritime scenarios with various objects such as ships and other vessels
    corecore